Computer-Aided Inflection for Lexicography Controlled Lexica

نویسنده

  • Maarten Janssen
چکیده

This article describes the design of a computational system for the development and maintenance of inflected lexica, developed as part of the Open Source Lexical Information Network (OLSIN). The system is built as a tool for lexicographers, and is flexible enough for the lexicographers to deal with any irregularities in the language, and transparent enough for the lexicographers to understand the rules used for the automatically generated inflections. It furthermore allows lexicographers to create and modify paradigm rules by themselves, making it easy to implement the system for any language, including less-resources languages. Apart from the system itself, this article describes some of the challenges and obstacles the design of such a system has to face, and the solutions adopted for them in the OSLIN framework.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Smart Paradigms and the Predictability and Complexity of Inflectional Morphology

Morphological lexica are often implemented on top of morphological paradigms, corresponding to different ways of building the full inflection table of a word. Computationally precise lexica may use hundreds of paradigms, and it can be hard for a lexicographer to choose among them. To automate this task, this paper introduces the notion of a smart paradigm. It is a metaparadigm, which inspects t...

متن کامل

Learning Morphology of Romance, Germanic and Slavic Languages with the Tool Linguistica

In this paper we present preliminary work conducted on semi-automatic induction of inflectional paradigms from non annotated corpora using the open-source tool Linguistica (Goldsmith 2001) that can be utilized without any prior knowledge of the language. The aim is to induce morphology information from corpora such as to compare languages and foresee the difficulty to develop morphosyntactic le...

متن کامل

Elsnet 97 Summer School Lexicon Development for Language and Speech Processing Draft Course Materials Computational Lexicography for Speech and Language 1.2 Lexical Resources 1.4 Lexical Databases and System Lexica

Contents 1 Aspects of lexicography 1 1. Bibliographical references 97 1 Aspects of lexicography This collection of course materials borrows from various materials, partly published and partly unpublished course materials. It is heterogenous, uneven, and currently in very rough shape, but contains a variety of diierent kinds of material relevant to spoken language lexicography. The collection st...

متن کامل

Inflection points and singularities on planar rational cubic curve segments

We obtain the distribution of inflection points and singularities on a parametric rational cubic curve segment with aid of Mathematica (A System of for Doing Mathematics by Computer). The reciprocal numbers of the magnitudes of the end slopes determine the occurrence of inflection points and singularities on the segment. Its use enables us to check whether the segment has inflection points or a...

متن کامل

Enriching Morphological Lexica through Unsupervised Derivational Rule Acquisition

In a morphological lexicon, each entry combines a lemma with a specific inflection class, often defined by a set of inflection rules. Therefore, such lexica usually give a satisfying account of inflectional operations. Derivational information, however, is usually badly covered. In this paper we introduce a novel approach for enriching morphological lexica with derivational links between entrie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011